Rotation and translation covariant match kernels for image retrieval
نویسندگان
چکیده
Most image encodings achieve orientation invariance by aligning the patches to their dominant orientations and translation invariance by completely ignoring patch position or by max-pooling. Albeit successful, such choices introduce too much invariance because they do not guarantee that the patches are rotated or translated consistently. In this paper, we propose a geometric-aware aggregation strategy, which jointly encodes the local descriptors together with their patch dominant angle or location. The geometric attributes are encoded in a continuous manner by leveraging explicit feature maps. Our technique is compatible with generic match kernel formulation and can be employed along with several popular encoding methods, in particular Bag-of-Words, VLAD and the Fisher vector. The method is further combined with an efficient monomial embedding to provide a codebook-free method aggregating local descriptors into a single vector representation. Invariance is achieved by efficient similarity estimation of multiple rotations or translations, offered by a simple trigonometric polynomial. This strategy is effective for image search, as shown by experiments performed on standard benchmarks for image and particular object retrieval, namely Holidays and Oxford buildings.
منابع مشابه
Illumination color covariant locale-based visual object retrieval
Search by Object Model — finding an object inside a target image — is a desirable and yet difficult mechanism for querying multimedia data. An added difficulty is that objects can be photographed under different lighting conditions. While human vision has color constancy, an invariant processing, presumably, here we seek only covariant processing and look to recover such lighting change. Making...
متن کاملA new shape retrieval method using the Group delay of the Fourier descriptors
In this paper, we introduced a new way to analyze the shape using a new Fourier based descriptor, which is the smoothed derivative of the phase of the Fourier descriptors. It is extracted from the complex boundary of the shape, and is called the smoothed group delay (SGD). The usage of SGD on the Fourier phase descriptors, allows a compact representation of the shape boundaries which is robust ...
متن کاملShape-Based Image Retrieval Applied to Trademark Images
We propose a new shape-based, query-by-example, image database retrieval method that is able to match a query image to one of the images in the database, based on a whole or partial match. The proposed method has two key components: the architecture of the retrieval and the features used. Both play a role in the overall retrieval efficacy. The proposed architecture is based on the analysis of c...
متن کاملShape-Based Image Retrieval Using Shape Matrix
Retrieval image by shape similarity, given a template shape is particularly challenging, owning to the difficulty to derive a similarity measurement that closely conforms to the common perception of similarity by humans. In this paper, a new method for the representation and comparison of shapes is present which is based on the shape matrix and snake model. It is scaling, rotation, translation ...
متن کاملRotation Invariant Texture Image Retrieval Based on Log-Polar and NSCT
In order to solve the problem of rotation invariant texture image retrieval, an image retrieval algorithm based on Log-Polar and nonsubsampled contourlet transform (NSCT) is proposed. Log-Polar transform was first applied to texture image to convert the rotation to translation. Then, translation invariant NSCT was employed to decompose the transformed images. Standard deviations, energies and e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Vision and Image Understanding
دوره 140 شماره
صفحات -
تاریخ انتشار 2015